Dataset statistics
| Number of variables | 20 |
|---|---|
| Number of observations | 388 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 60.8 KiB |
| Average record size in memory | 160.3 B |
Variable types
| Categorical | 3 |
|---|---|
| Numeric | 15 |
| Boolean | 2 |
Churn has constant value "1" | Constant |
State has a high cardinality: 51 distinct values | High cardinality |
Total day minutes is highly correlated with Total day charge | High correlation |
Total day charge is highly correlated with Total day minutes | High correlation |
Total eve minutes is highly correlated with Total eve charge | High correlation |
Total eve charge is highly correlated with Total eve minutes | High correlation |
Total night minutes is highly correlated with Total night charge | High correlation |
Total night charge is highly correlated with Total night minutes | High correlation |
Total intl minutes is highly correlated with Total intl charge | High correlation |
Total intl charge is highly correlated with Total intl minutes | High correlation |
Area code is highly correlated with Churn | High correlation |
Churn is highly correlated with Area code and 3 other fields | High correlation |
International plan is highly correlated with Churn | High correlation |
State is highly correlated with Churn | High correlation |
Voice mail plan is highly correlated with Churn | High correlation |
Number vmail messages has 323 (83.2%) zeros | Zeros |
Customer service calls has 79 (20.4%) zeros | Zeros |
Reproduction
| Analysis started | 2021-04-10 21:20:26.785164 |
|---|---|
| Analysis finished | 2021-04-10 21:20:49.238582 |
| Duration | 22.45 seconds |
| Software version | pandas-profiling v2.11.0 |
| Download configuration | config.yaml |
| Distinct | 51 |
|---|---|
| Distinct (%) | 13.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.2 KiB |
| TX | 16 |
|---|---|
| NJ | 14 |
| MD | 14 |
| MI | 13 |
| NV | 13 |
| Other values (46) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 776 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | CO |
|---|---|
| 2nd row | AZ |
| 3rd row | MD |
| 4th row | WY |
| 5th row | CO |
| Value | Count | Frequency (%) |
| TX | 16 | 4.1% |
| NJ | 14 | 3.6% |
| MD | 14 | 3.6% |
| MI | 13 | 3.4% |
| NV | 13 | 3.4% |
| MN | 13 | 3.4% |
| NY | 12 | 3.1% |
| AR | 11 | 2.8% |
| CT | 11 | 2.8% |
| ME | 11 | 2.8% |
| Other values (41) | 260 |
| Value | Count | Frequency (%) |
| tx | 16 | 4.1% |
| md | 14 | 3.6% |
| nj | 14 | 3.6% |
| nv | 13 | 3.4% |
| mi | 13 | 3.4% |
| mn | 13 | 3.4% |
| ny | 12 | 3.1% |
| sc | 11 | 2.8% |
| ar | 11 | 2.8% |
| ms | 11 | 2.8% |
| Other values (41) | 260 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 93 | |
| M | 89 | 11.5% |
| A | 73 | 9.4% |
| T | 56 | 7.2% |
| C | 48 | 6.2% |
| D | 42 | 5.4% |
| I | 40 | 5.2% |
| S | 38 | 4.9% |
| O | 36 | 4.6% |
| V | 30 | 3.9% |
| Other values (14) | 231 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 776 |
Most frequent character per category
| Value | Count | Frequency (%) |
| N | 93 | |
| M | 89 | 11.5% |
| A | 73 | 9.4% |
| T | 56 | 7.2% |
| C | 48 | 6.2% |
| D | 42 | 5.4% |
| I | 40 | 5.2% |
| S | 38 | 4.9% |
| O | 36 | 4.6% |
| V | 30 | 3.9% |
| Other values (14) | 231 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 776 |
Most frequent character per script
| Value | Count | Frequency (%) |
| N | 93 | |
| M | 89 | 11.5% |
| A | 73 | 9.4% |
| T | 56 | 7.2% |
| C | 48 | 6.2% |
| D | 42 | 5.4% |
| I | 40 | 5.2% |
| S | 38 | 4.9% |
| O | 36 | 4.6% |
| V | 30 | 3.9% |
| Other values (14) | 231 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 776 |
Most frequent character per block
| Value | Count | Frequency (%) |
| N | 93 | |
| M | 89 | 11.5% |
| A | 73 | 9.4% |
| T | 56 | 7.2% |
| C | 48 | 6.2% |
| D | 42 | 5.4% |
| I | 40 | 5.2% |
| S | 38 | 4.9% |
| O | 36 | 4.6% |
| V | 30 | 3.9% |
| Other values (14) | 231 |
Account length
Real number (ℝ≥0)
| Distinct | 154 |
|---|---|
| Distinct (%) | 39.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 102.3195876 |
|---|---|
| Minimum | 1 |
| Maximum | 225 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 36.35 |
| Q1 | 75.75 |
| median | 103 |
| Q3 | 127 |
| 95-th percentile | 170 |
| Maximum | 225 |
| Range | 224 |
| Interquartile range (IQR) | 51.25 |
Descriptive statistics
| Standard deviation | 40.18459895 |
|---|---|
| Coefficient of variation (CV) | 0.3927361308 |
| Kurtosis | -0.02151221649 |
| Mean | 102.3195876 |
| Median Absolute Deviation (MAD) | 27 |
| Skewness | 0.09201571449 |
| Sum | 39700 |
| Variance | 1614.801993 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 115 | 7 | 1.8% |
| 93 | 7 | 1.8% |
| 113 | 7 | 1.8% |
| 88 | 7 | 1.8% |
| 76 | 7 | 1.8% |
| 105 | 6 | 1.5% |
| 97 | 6 | 1.5% |
| 98 | 6 | 1.5% |
| 108 | 6 | 1.5% |
| 119 | 5 | 1.3% |
| Other values (144) | 324 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 12 | 1 | |
| 13 | 1 | |
| 16 | 1 |
| Value | Count | Frequency (%) |
| 225 | 1 | |
| 224 | 1 | |
| 212 | 1 | |
| 201 | 1 | |
| 197 | 1 |
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.2 KiB |
| 415 | |
|---|---|
| 510 | |
| 408 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1164 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 408 |
|---|---|
| 2nd row | 408 |
| 3rd row | 408 |
| 4th row | 415 |
| 5th row | 408 |
| Value | Count | Frequency (%) |
| 415 | 195 | |
| 510 | 99 | |
| 408 | 94 |
| Value | Count | Frequency (%) |
| 415 | 195 | |
| 510 | 99 | |
| 408 | 94 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 294 | |
| 5 | 294 | |
| 4 | 289 | |
| 0 | 193 | |
| 8 | 94 | 8.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1164 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 1 | 294 | |
| 5 | 294 | |
| 4 | 289 | |
| 0 | 193 | |
| 8 | 94 | 8.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1164 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 1 | 294 | |
| 5 | 294 | |
| 4 | 289 | |
| 0 | 193 | |
| 8 | 94 | 8.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1164 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 1 | 294 | |
| 5 | 294 | |
| 4 | 289 | |
| 0 | 193 | |
| 8 | 94 | 8.1% |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 516.0 B |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 270 | |
| True | 118 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 516.0 B |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 323 | |
| True | 65 | 16.8% |
| Distinct | 27 |
|---|---|
| Distinct (%) | 7.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.170103093 |
|---|---|
| Minimum | 0 |
| Maximum | 45 |
| Zeros | 323 |
| Zeros (%) | 83.2% |
| Memory size | 3.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 33 |
| Maximum | 45 |
| Range | 45 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 11.87649332 |
|---|---|
| Coefficient of variation (CV) | 2.297148259 |
| Kurtosis | 2.375173698 |
| Mean | 5.170103093 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.002884776 |
| Sum | 2006 |
| Variance | 141.0510935 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 323 | |
| 29 | 6 | 1.5% |
| 33 | 6 | 1.5% |
| 32 | 6 | 1.5% |
| 26 | 5 | 1.3% |
| 42 | 4 | 1.0% |
| 35 | 4 | 1.0% |
| 31 | 4 | 1.0% |
| 28 | 4 | 1.0% |
| 36 | 3 | 0.8% |
| Other values (17) | 23 | 5.9% |
| Value | Count | Frequency (%) |
| 0 | 323 | |
| 16 | 1 | 0.3% |
| 17 | 1 | 0.3% |
| 18 | 2 | 0.5% |
| 19 | 1 | 0.3% |
| Value | Count | Frequency (%) |
| 45 | 1 | 0.3% |
| 44 | 2 | |
| 42 | 4 | |
| 41 | 1 | 0.3% |
| 40 | 1 | 0.3% |
| Distinct | 365 |
|---|---|
| Distinct (%) | 94.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 205.1811856 |
|---|---|
| Minimum | 0 |
| Maximum | 350.8 |
| Zeros | 1 |
| Zeros (%) | 0.3% |
| Memory size | 3.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 97.65 |
| Q1 | 150.9 |
| median | 214.95 |
| Q3 | 262.2 |
| 95-th percentile | 304.68 |
| Maximum | 350.8 |
| Range | 350.8 |
| Interquartile range (IQR) | 111.3 |
Descriptive statistics
| Standard deviation | 68.49021343 |
|---|---|
| Coefficient of variation (CV) | 0.3338035758 |
| Kurtosis | -0.751902766 |
| Mean | 205.1811856 |
| Median Absolute Deviation (MAD) | 55.05 |
| Skewness | -0.1842694097 |
| Sum | 79610.3 |
| Variance | 4690.909335 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 242.2 | 2 | 0.5% |
| 131.6 | 2 | 0.5% |
| 176.9 | 2 | 0.5% |
| 256.4 | 2 | 0.5% |
| 189.1 | 2 | 0.5% |
| 162.3 | 2 | 0.5% |
| 236.9 | 2 | 0.5% |
| 162.1 | 2 | 0.5% |
| 133.3 | 2 | 0.5% |
| 248.7 | 2 | 0.5% |
| Other values (355) | 368 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 46.5 | 1 | |
| 47.7 | 1 | |
| 47.8 | 1 | |
| 54.2 | 1 |
| Value | Count | Frequency (%) |
| 350.8 | 1 | |
| 346.8 | 1 | |
| 345.3 | 1 | |
| 337.4 | 1 | |
| 335.5 | 1 |
Total day calls
Real number (ℝ≥0)
| Distinct | 96 |
|---|---|
| Distinct (%) | 24.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 101.1958763 |
|---|---|
| Minimum | 0 |
| Maximum | 156 |
| Zeros | 1 |
| Zeros (%) | 0.3% |
| Memory size | 3.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 63 |
| Q1 | 87 |
| median | 103 |
| Q3 | 116 |
| 95-th percentile | 134 |
| Maximum | 156 |
| Range | 156 |
| Interquartile range (IQR) | 29 |
Descriptive statistics
| Standard deviation | 21.70527945 |
|---|---|
| Coefficient of variation (CV) | 0.2144877859 |
| Kurtosis | 0.8873766371 |
| Mean | 101.1958763 |
| Median Absolute Deviation (MAD) | 14 |
| Skewness | -0.4495566036 |
| Sum | 39264 |
| Variance | 471.1191561 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 106 | 13 | 3.4% |
| 108 | 12 | 3.1% |
| 112 | 10 | 2.6% |
| 101 | 10 | 2.6% |
| 99 | 9 | 2.3% |
| 103 | 9 | 2.3% |
| 83 | 9 | 2.3% |
| 86 | 9 | 2.3% |
| 120 | 9 | 2.3% |
| 109 | 8 | 2.1% |
| Other values (86) | 290 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 42 | 1 | |
| 44 | 2 | |
| 45 | 1 | |
| 47 | 1 |
| Value | Count | Frequency (%) |
| 156 | 1 | |
| 151 | 2 | |
| 148 | 1 | |
| 147 | 2 | |
| 145 | 1 |
| Distinct | 365 |
|---|---|
| Distinct (%) | 94.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 34.88134021 |
|---|---|
| Minimum | 0 |
| Maximum | 59.64 |
| Zeros | 1 |
| Zeros (%) | 0.3% |
| Memory size | 3.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 16.5985 |
| Q1 | 25.6525 |
| median | 36.54 |
| Q3 | 44.5775 |
| 95-th percentile | 51.7965 |
| Maximum | 59.64 |
| Range | 59.64 |
| Interquartile range (IQR) | 18.925 |
Descriptive statistics
| Standard deviation | 11.64347874 |
|---|---|
| Coefficient of variation (CV) | 0.333802505 |
| Kurtosis | -0.7518204735 |
| Mean | 34.88134021 |
| Median Absolute Deviation (MAD) | 9.36 |
| Skewness | -0.1842373786 |
| Sum | 13533.96 |
| Variance | 135.5705972 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 41.68 | 2 | 0.5% |
| 48.69 | 2 | 0.5% |
| 24.65 | 2 | 0.5% |
| 46.09 | 2 | 0.5% |
| 28.41 | 2 | 0.5% |
| 27.59 | 2 | 0.5% |
| 22.66 | 2 | 0.5% |
| 43.59 | 2 | 0.5% |
| 22.37 | 2 | 0.5% |
| 46.36 | 2 | 0.5% |
| Other values (355) | 368 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 7.91 | 1 | |
| 8.11 | 1 | |
| 8.13 | 1 | |
| 9.21 | 1 |
| Value | Count | Frequency (%) |
| 59.64 | 1 | |
| 58.96 | 1 | |
| 58.7 | 1 | |
| 57.36 | 1 | |
| 57.04 | 1 |
| Distinct | 349 |
|---|---|
| Distinct (%) | 89.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 209.3853093 |
|---|---|
| Minimum | 70.9 |
| Maximum | 363.7 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.2 KiB |
Quantile statistics
| Minimum | 70.9 |
|---|---|
| 5-th percentile | 126.675 |
| Q1 | 173.15 |
| median | 209 |
| Q3 | 248.325 |
| 95-th percentile | 287.755 |
| Maximum | 363.7 |
| Range | 292.8 |
| Interquartile range (IQR) | 75.175 |
Descriptive statistics
| Standard deviation | 50.86371836 |
|---|---|
| Coefficient of variation (CV) | 0.2429192312 |
| Kurtosis | -0.09507014465 |
| Mean | 209.3853093 |
| Median Absolute Deviation (MAD) | 37.35 |
| Skewness | 0.03540140643 |
| Sum | 81241.5 |
| Variance | 2587.117846 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 209.4 | 3 | 0.8% |
| 226.1 | 3 | 0.8% |
| 169.9 | 2 | 0.5% |
| 253.4 | 2 | 0.5% |
| 208.9 | 2 | 0.5% |
| 134.1 | 2 | 0.5% |
| 188.8 | 2 | 0.5% |
| 303.4 | 2 | 0.5% |
| 190 | 2 | 0.5% |
| 179.3 | 2 | 0.5% |
| Other values (339) | 366 |
| Value | Count | Frequency (%) |
| 70.9 | 1 | |
| 75.3 | 1 | |
| 77.1 | 1 | |
| 92.3 | 1 | |
| 93.7 | 1 |
| Value | Count | Frequency (%) |
| 363.7 | 1 | |
| 350.9 | 1 | |
| 347.3 | 1 | |
| 339.9 | 1 | |
| 327 | 1 |
Total eve calls
Real number (ℝ≥0)
| Distinct | 87 |
|---|---|
| Distinct (%) | 22.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 99.94845361 |
|---|---|
| Minimum | 48 |
| Maximum | 159 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.2 KiB |
Quantile statistics
| Minimum | 48 |
|---|---|
| 5-th percentile | 67 |
| Q1 | 86 |
| median | 100.5 |
| Q3 | 113 |
| 95-th percentile | 132 |
| Maximum | 159 |
| Range | 111 |
| Interquartile range (IQR) | 27 |
Descriptive statistics
| Standard deviation | 19.60547365 |
|---|---|
| Coefficient of variation (CV) | 0.1961558478 |
| Kurtosis | -0.2661983882 |
| Mean | 99.94845361 |
| Median Absolute Deviation (MAD) | 13.5 |
| Skewness | -0.06270721028 |
| Sum | 38780 |
| Variance | 384.3745971 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 94 | 12 | 3.1% |
| 102 | 12 | 3.1% |
| 111 | 11 | 2.8% |
| 100 | 9 | 2.3% |
| 108 | 9 | 2.3% |
| 117 | 9 | 2.3% |
| 86 | 9 | 2.3% |
| 92 | 9 | 2.3% |
| 105 | 9 | 2.3% |
| 122 | 8 | 2.1% |
| Other values (77) | 291 |
| Value | Count | Frequency (%) |
| 48 | 2 | |
| 53 | 1 | |
| 54 | 1 | |
| 56 | 2 | |
| 59 | 1 |
| Value | Count | Frequency (%) |
| 159 | 1 | |
| 147 | 1 | |
| 144 | 1 | |
| 143 | 1 | |
| 142 | 1 |
| Distinct | 338 |
|---|---|
| Distinct (%) | 87.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 17.79786082 |
|---|---|
| Minimum | 6.03 |
| Maximum | 30.91 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.2 KiB |
Quantile statistics
| Minimum | 6.03 |
|---|---|
| 5-th percentile | 10.767 |
| Q1 | 14.7175 |
| median | 17.765 |
| Q3 | 21.11 |
| 95-th percentile | 24.459 |
| Maximum | 30.91 |
| Range | 24.88 |
| Interquartile range (IQR) | 6.3925 |
Descriptive statistics
| Standard deviation | 4.323326587 |
|---|---|
| Coefficient of variation (CV) | 0.242912709 |
| Kurtosis | -0.09500132295 |
| Mean | 17.79786082 |
| Median Absolute Deviation (MAD) | 3.175 |
| Skewness | 0.03542926839 |
| Sum | 6905.57 |
| Variance | 18.69115278 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 17.71 | 4 | 1.0% |
| 17.8 | 3 | 0.8% |
| 14.2 | 3 | 0.8% |
| 19.22 | 3 | 0.8% |
| 17.22 | 2 | 0.5% |
| 15.56 | 2 | 0.5% |
| 16.15 | 2 | 0.5% |
| 22.07 | 2 | 0.5% |
| 25.79 | 2 | 0.5% |
| 15.24 | 2 | 0.5% |
| Other values (328) | 363 |
| Value | Count | Frequency (%) |
| 6.03 | 1 | |
| 6.4 | 1 | |
| 6.55 | 1 | |
| 7.85 | 1 | |
| 7.96 | 1 |
| Value | Count | Frequency (%) |
| 30.91 | 1 | |
| 29.83 | 1 | |
| 29.52 | 1 | |
| 28.89 | 1 | |
| 27.8 | 1 |
| Distinct | 349 |
|---|---|
| Distinct (%) | 89.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 205.3072165 |
|---|---|
| Minimum | 47.4 |
| Maximum | 354.9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.2 KiB |
Quantile statistics
| Minimum | 47.4 |
|---|---|
| 5-th percentile | 129.25 |
| Q1 | 169.925 |
| median | 204.95 |
| Q3 | 241.15 |
| 95-th percentile | 280.695 |
| Maximum | 354.9 |
| Range | 307.5 |
| Interquartile range (IQR) | 71.225 |
Descriptive statistics
| Standard deviation | 47.56515726 |
|---|---|
| Coefficient of variation (CV) | 0.2316779608 |
| Kurtosis | -0.1882849493 |
| Mean | 205.3072165 |
| Median Absolute Deviation (MAD) | 35.9 |
| Skewness | 0.01552955433 |
| Sum | 79659.2 |
| Variance | 2262.444186 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 178.1 | 3 | 0.8% |
| 227 | 3 | 0.8% |
| 214.2 | 3 | 0.8% |
| 167.8 | 2 | 0.5% |
| 208.9 | 2 | 0.5% |
| 184.2 | 2 | 0.5% |
| 195 | 2 | 0.5% |
| 254.9 | 2 | 0.5% |
| 249.4 | 2 | 0.5% |
| 153.2 | 2 | 0.5% |
| Other values (339) | 365 |
| Value | Count | Frequency (%) |
| 47.4 | 1 | |
| 73.2 | 1 | |
| 87.4 | 1 | |
| 104.9 | 1 | |
| 107.3 | 1 |
| Value | Count | Frequency (%) |
| 354.9 | 1 | |
| 332.7 | 1 | |
| 321.2 | 1 | |
| 309.1 | 1 | |
| 308.9 | 1 |
Total night calls
Real number (ℝ≥0)
| Distinct | 94 |
|---|---|
| Distinct (%) | 24.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 100.6829897 |
|---|---|
| Minimum | 49 |
| Maximum | 158 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.2 KiB |
Quantile statistics
| Minimum | 49 |
|---|---|
| 5-th percentile | 69.35 |
| Q1 | 85.75 |
| median | 101 |
| Q3 | 116 |
| 95-th percentile | 132 |
| Maximum | 158 |
| Range | 109 |
| Interquartile range (IQR) | 30.25 |
Descriptive statistics
| Standard deviation | 20.07466732 |
|---|---|
| Coefficient of variation (CV) | 0.1993848949 |
| Kurtosis | -0.3220498253 |
| Mean | 100.6829897 |
| Median Absolute Deviation (MAD) | 15 |
| Skewness | 0.04762314947 |
| Sum | 39065 |
| Variance | 402.992268 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 106 | 11 | 2.8% |
| 97 | 10 | 2.6% |
| 84 | 10 | 2.6% |
| 78 | 9 | 2.3% |
| 102 | 9 | 2.3% |
| 104 | 9 | 2.3% |
| 95 | 9 | 2.3% |
| 111 | 9 | 2.3% |
| 118 | 9 | 2.3% |
| 115 | 8 | 2.1% |
| Other values (84) | 295 |
| Value | Count | Frequency (%) |
| 49 | 1 | |
| 51 | 1 | |
| 53 | 1 | |
| 56 | 1 | |
| 57 | 1 |
| Value | Count | Frequency (%) |
| 158 | 1 | |
| 152 | 2 | |
| 151 | 1 | |
| 147 | 1 | |
| 146 | 1 |
| Distinct | 289 |
|---|---|
| Distinct (%) | 74.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.238891753 |
|---|---|
| Minimum | 2.13 |
| Maximum | 15.97 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.2 KiB |
Quantile statistics
| Minimum | 2.13 |
|---|---|
| 5-th percentile | 5.8175 |
| Q1 | 7.6475 |
| median | 9.225 |
| Q3 | 10.8525 |
| 95-th percentile | 12.633 |
| Maximum | 15.97 |
| Range | 13.84 |
| Interquartile range (IQR) | 3.205 |
Descriptive statistics
| Standard deviation | 2.140617182 |
|---|---|
| Coefficient of variation (CV) | 0.2316963159 |
| Kurtosis | -0.1877677769 |
| Mean | 9.238891753 |
| Median Absolute Deviation (MAD) | 1.615 |
| Skewness | 0.0151953845 |
| Sum | 3584.69 |
| Variance | 4.582241921 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 8.01 | 4 | 1.0% |
| 9.64 | 4 | 1.0% |
| 9.4 | 4 | 1.0% |
| 10.22 | 4 | 1.0% |
| 8.19 | 3 | 0.8% |
| 8.88 | 3 | 0.8% |
| 12.11 | 3 | 0.8% |
| 7.55 | 3 | 0.8% |
| 10.52 | 3 | 0.8% |
| 10.08 | 3 | 0.8% |
| Other values (279) | 354 |
| Value | Count | Frequency (%) |
| 2.13 | 1 | |
| 3.29 | 1 | |
| 3.93 | 1 | |
| 4.72 | 1 | |
| 4.83 | 1 |
| Value | Count | Frequency (%) |
| 15.97 | 1 | |
| 14.97 | 1 | |
| 14.45 | 1 | |
| 13.91 | 1 | |
| 13.9 | 1 |
| Distinct | 111 |
|---|---|
| Distinct (%) | 28.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.8193299 |
|---|---|
| Minimum | 3.9 |
| Maximum | 20 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.2 KiB |
Quantile statistics
| Minimum | 3.9 |
|---|---|
| 5-th percentile | 6.2 |
| Q1 | 8.9 |
| median | 10.8 |
| Q3 | 12.925 |
| 95-th percentile | 14.8 |
| Maximum | 20 |
| Range | 16.1 |
| Interquartile range (IQR) | 4.025 |
Descriptive statistics
| Standard deviation | 2.771824382 |
|---|---|
| Coefficient of variation (CV) | 0.2561918721 |
| Kurtosis | -0.1462839677 |
| Mean | 10.8193299 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.02455527575 |
| Sum | 4197.9 |
| Variance | 7.683010403 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 13.9 | 11 | 2.8% |
| 11.5 | 9 | 2.3% |
| 11.1 | 9 | 2.3% |
| 11 | 8 | 2.1% |
| 10.1 | 7 | 1.8% |
| 10 | 7 | 1.8% |
| 9.6 | 7 | 1.8% |
| 7.9 | 7 | 1.8% |
| 13.3 | 7 | 1.8% |
| 11.3 | 7 | 1.8% |
| Other values (101) | 309 |
| Value | Count | Frequency (%) |
| 3.9 | 1 | |
| 4.1 | 2 | |
| 4.2 | 1 | |
| 4.5 | 2 | |
| 4.7 | 1 |
| Value | Count | Frequency (%) |
| 20 | 1 | |
| 17.9 | 1 | |
| 17.6 | 1 | |
| 17.5 | 1 | |
| 17.3 | 1 |
Total intl calls
Real number (ℝ≥0)
| Distinct | 16 |
|---|---|
| Distinct (%) | 4.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.051546392 |
|---|---|
| Minimum | 1 |
| Maximum | 20 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 9 |
| Maximum | 20 |
| Range | 19 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.468912173 |
|---|---|
| Coefficient of variation (CV) | 0.6093752692 |
| Kurtosis | 6.582263802 |
| Mean | 4.051546392 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.923546794 |
| Sum | 1572 |
| Variance | 6.095527318 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 86 | |
| 3 | 84 | |
| 4 | 65 | |
| 5 | 42 | |
| 6 | 35 | |
| 1 | 26 | 6.7% |
| 7 | 21 | 5.4% |
| 9 | 10 | 2.6% |
| 8 | 8 | 2.1% |
| 10 | 3 | 0.8% |
| Other values (6) | 8 | 2.1% |
| Value | Count | Frequency (%) |
| 1 | 26 | 6.7% |
| 2 | 86 | |
| 3 | 84 | |
| 4 | 65 | |
| 5 | 42 |
| Value | Count | Frequency (%) |
| 20 | 1 | |
| 15 | 2 | |
| 14 | 1 | |
| 13 | 1 | |
| 12 | 1 |
| Distinct | 111 |
|---|---|
| Distinct (%) | 28.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.921726804 |
|---|---|
| Minimum | 1.05 |
| Maximum | 5.4 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.2 KiB |
Quantile statistics
| Minimum | 1.05 |
|---|---|
| 5-th percentile | 1.67 |
| Q1 | 2.4 |
| median | 2.92 |
| Q3 | 3.4875 |
| 95-th percentile | 4 |
| Maximum | 5.4 |
| Range | 4.35 |
| Interquartile range (IQR) | 1.0875 |
Descriptive statistics
| Standard deviation | 0.7484308185 |
|---|---|
| Coefficient of variation (CV) | 0.2561604382 |
| Kurtosis | -0.147718281 |
| Mean | 2.921726804 |
| Median Absolute Deviation (MAD) | 0.54 |
| Skewness | 0.02415757327 |
| Sum | 1133.63 |
| Variance | 0.56014869 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 3.75 | 11 | 2.8% |
| 3.11 | 9 | 2.3% |
| 3 | 9 | 2.3% |
| 2.97 | 8 | 2.1% |
| 3.59 | 7 | 1.8% |
| 2.13 | 7 | 1.8% |
| 2.59 | 7 | 1.8% |
| 3.05 | 7 | 1.8% |
| 2.73 | 7 | 1.8% |
| 2.7 | 7 | 1.8% |
| Other values (101) | 309 |
| Value | Count | Frequency (%) |
| 1.05 | 1 | |
| 1.11 | 2 | |
| 1.13 | 1 | |
| 1.22 | 2 | |
| 1.27 | 1 |
| Value | Count | Frequency (%) |
| 5.4 | 1 | |
| 4.83 | 1 | |
| 4.75 | 1 | |
| 4.73 | 1 | |
| 4.67 | 1 |
| Distinct | 10 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.206185567 |
|---|---|
| Minimum | 0 |
| Maximum | 9 |
| Zeros | 79 |
| Zeros (%) | 20.4% |
| Memory size | 3.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 2 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 9 |
| Range | 9 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.882535781 |
|---|---|
| Coefficient of variation (CV) | 0.8532989289 |
| Kurtosis | 0.03255737131 |
| Mean | 2.206185567 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.7460275042 |
| Sum | 856 |
| Variance | 3.543940968 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 99 | |
| 0 | 79 | |
| 4 | 64 | |
| 2 | 62 | |
| 3 | 37 | 9.5% |
| 5 | 29 | 7.5% |
| 6 | 10 | 2.6% |
| 7 | 5 | 1.3% |
| 9 | 2 | 0.5% |
| 8 | 1 | 0.3% |
| Value | Count | Frequency (%) |
| 0 | 79 | |
| 1 | 99 | |
| 2 | 62 | |
| 3 | 37 | 9.5% |
| 4 | 64 |
| Value | Count | Frequency (%) |
| 9 | 2 | 0.5% |
| 8 | 1 | 0.3% |
| 7 | 5 | 1.3% |
| 6 | 10 | 2.6% |
| 5 | 29 |
| Distinct | 1 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.2 KiB |
| 1 |
|---|
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 388 |
|---|---|
| Distinct characters | 1 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| 1 | 388 |
| Value | Count | Frequency (%) |
| 1 | 388 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 388 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 388 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 1 | 388 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 388 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 1 | 388 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 388 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 1 | 388 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| State | Account length | Area code | International plan | Voice mail plan | Number vmail messages | Total day minutes | Total day calls | Total day charge | Total eve minutes | Total eve calls | Total eve charge | Total night minutes | Total night calls | Total night charge | Total intl minutes | Total intl calls | Total intl charge | Customer service calls | Churn | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | CO | 77 | 408 | No | No | 0 | 62.4 | 89 | 10.61 | 169.9 | 121 | 14.44 | 209.6 | 64 | 9.43 | 5.7 | 6 | 1.54 | 5 | 1 |
| 1 | AZ | 12 | 408 | No | No | 0 | 249.6 | 118 | 42.43 | 252.4 | 119 | 21.45 | 280.2 | 90 | 12.61 | 11.8 | 3 | 3.19 | 1 | 1 |
| 2 | MD | 135 | 408 | Yes | Yes | 41 | 173.1 | 85 | 29.43 | 203.9 | 107 | 17.33 | 122.2 | 78 | 5.50 | 14.6 | 15 | 3.94 | 0 | 1 |
| 3 | WY | 87 | 415 | No | No | 0 | 151.0 | 83 | 25.67 | 219.7 | 116 | 18.67 | 203.9 | 127 | 9.18 | 9.7 | 3 | 2.62 | 5 | 1 |
| 4 | CO | 121 | 408 | No | Yes | 30 | 198.4 | 129 | 33.73 | 75.3 | 77 | 6.40 | 181.2 | 77 | 8.15 | 5.8 | 3 | 1.57 | 3 | 1 |
| 5 | TX | 150 | 510 | No | No | 0 | 178.9 | 101 | 30.41 | 169.1 | 110 | 14.37 | 148.6 | 100 | 6.69 | 13.8 | 3 | 3.73 | 4 | 1 |
| 6 | DC | 82 | 415 | No | No | 0 | 300.3 | 109 | 51.05 | 181.0 | 100 | 15.39 | 270.1 | 73 | 12.15 | 11.7 | 4 | 3.16 | 0 | 1 |
| 7 | NY | 144 | 408 | No | No | 0 | 61.6 | 117 | 10.47 | 77.1 | 85 | 6.55 | 173.0 | 99 | 7.79 | 8.2 | 7 | 2.21 | 4 | 1 |
| 8 | TX | 106 | 510 | No | No | 0 | 210.6 | 96 | 35.80 | 249.2 | 85 | 21.18 | 191.4 | 88 | 8.61 | 12.4 | 1 | 3.35 | 2 | 1 |
| 9 | IN | 94 | 408 | No | No | 0 | 157.9 | 105 | 26.84 | 155.0 | 101 | 13.18 | 189.6 | 84 | 8.53 | 8.0 | 5 | 2.16 | 4 | 1 |
Last rows
| State | Account length | Area code | International plan | Voice mail plan | Number vmail messages | Total day minutes | Total day calls | Total day charge | Total eve minutes | Total eve calls | Total eve charge | Total night minutes | Total night calls | Total night charge | Total intl minutes | Total intl calls | Total intl charge | Customer service calls | Churn | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 378 | OK | 146 | 510 | No | No | 0 | 138.4 | 104 | 23.53 | 158.9 | 122 | 13.51 | 47.4 | 73 | 2.13 | 3.9 | 9 | 1.05 | 4 | 1 |
| 379 | RI | 138 | 510 | Yes | No | 0 | 286.2 | 61 | 48.65 | 187.2 | 60 | 15.91 | 146.2 | 114 | 6.58 | 11.0 | 4 | 2.97 | 2 | 1 |
| 380 | ID | 82 | 415 | No | No | 0 | 266.9 | 83 | 45.37 | 229.7 | 74 | 19.52 | 251.7 | 99 | 11.33 | 11.0 | 6 | 2.97 | 3 | 1 |
| 381 | AR | 76 | 408 | No | No | 0 | 107.3 | 140 | 18.24 | 238.2 | 133 | 20.25 | 271.8 | 116 | 12.23 | 10.0 | 3 | 2.70 | 4 | 1 |
| 382 | KS | 170 | 415 | No | Yes | 42 | 199.5 | 119 | 33.92 | 135.0 | 90 | 11.48 | 184.6 | 49 | 8.31 | 10.9 | 3 | 2.94 | 4 | 1 |
| 383 | MI | 119 | 510 | Yes | Yes | 22 | 172.1 | 119 | 29.26 | 223.6 | 133 | 19.01 | 150.0 | 94 | 6.75 | 13.9 | 20 | 3.75 | 1 | 1 |
| 384 | IL | 71 | 510 | Yes | No | 0 | 186.1 | 114 | 31.64 | 198.6 | 140 | 16.88 | 206.5 | 80 | 9.29 | 13.8 | 5 | 3.73 | 4 | 1 |
| 385 | GA | 122 | 510 | Yes | No | 0 | 140.0 | 101 | 23.80 | 196.4 | 77 | 16.69 | 120.1 | 133 | 5.40 | 9.7 | 4 | 2.62 | 4 | 1 |
| 386 | MD | 62 | 408 | No | No | 0 | 321.1 | 105 | 54.59 | 265.5 | 122 | 22.57 | 180.5 | 72 | 8.12 | 11.5 | 2 | 3.11 | 4 | 1 |
| 387 | IN | 117 | 415 | No | No | 0 | 118.4 | 126 | 20.13 | 249.3 | 97 | 21.19 | 227.0 | 56 | 10.22 | 13.6 | 3 | 3.67 | 5 | 1 |